arXiv:1507.03767v2 [cs.SE] 17 Sep 2015 


Understanding the Affect of Developers: Theoretical 
Background and Guidelines for Psychoempirical Software 

Engineering 


Daniel Graziotin 
Faculty of Computer Science, 
Free University of 
Bozen-Bolzano 
Piazza Domenicani 3 
Bolzano/Bozen, Italy 

daniel.graziotin@unibz.it 


Xiaofeng Wang 
Faculty of Computer Science, 
Free University of 
Bozen-Bolzano 
Piazza Domenicani 3 
Bolzano/Bozen, Italy 

xiaofeng.wang@unibz.it 


Pekka Abrahamsson 
Department of computer and 
information science 
Norwegian University of 
Science and Technology 
NO-7491 Trondheim, Norway 
pekkaa@ntnu.no 


ABSTRACT 

Affects—emotions and moods—have an impact on cognitive 
processing activities and the working performance of indi¬ 
viduals. It has been established that software development 
tasks are undertaken through cognitive processing activities. 
Therefore, we have proposed to employ psychology theory 
and measurements in software engineering (SE) research. 
We have called it “psychoempirical software engineering”. 
However, we found out that existing SE research has often 
fallen into misconceptions about the affect of developers, 
lacking in background theory and how to successfully employ 
psychological measurements in studies. The contribution of 
this paper is threefold. (1) It highlights the challenges to 
conduct proper affect-related studies with psychology; (2) it 
provides a comprehensive literature review in affect theory; 
and (3) it proposes guidelines for conducting psychoempirical 
software engineering. 

Categories and Subject Descriptors 

D.2.9 [Software Engineering]: Management— Productiv¬ 
ity, Programming Teams', H.1.2 [Models and Principles]: 
User/Machine Systems— Human factors, Software psychol¬ 
ogy, J.4 [Social and behavioral Science]: [Psychology] 

Keywords 

Affects, emotions, moods, human aspects in software develop¬ 
ment, psychology of programming, psychoempirical software 
engineering 


1. INTRODUCTION 

The Nobel prize winner Daniel Kahneman has pointed out 
that it is unrealistic to limit our understanding of human 


behaviors solely through rational models |40| . Yet, soft¬ 
ware engineering (SE) research has been known to be too 
much confined in the fallacy of rationality-above-everything 
paradigm [62| , to miss out the possibility to be a social 
discipline |81| , and to focus too much on domains of tech¬ 
nical nature while neglecting the so-called soft aspects or 
human-related topics [51| . But software development is a 
very human activity. Software development happens in our 
minds first, then on artifacts [25| . It has been established 
that development is intellectual, and it is ca rried out through 
cognitive processing activities [M 25 41 . Indeed, we are 
human beings, and, as such, we behave based on affect as we 
encounter the world through our emotions and moods [10| . 
The affects pervade organizations by coloring the workers’ 
thoughts, and they influence their behavior |^. Affects have 
a role in the relationships between workers, deadlines, work 
motivation, sense-making, and human-resource processes 
1^. Although affects have been historically neglected in the 
studies of industrial and organizational psychology [60| , an 
interest in the role of affects on job outcomes has accelerated 
over the past fifteen years in psychology research [^. While 
research is still needed on the impact of affects on cognitive 
activities and work-related achievements in general, this link 
undeniably exists according to psychology research. 

We have shown elsewhere that practitioners are deeply 
interested in their affects while developing software, which 
causes them to engage in long and interesting discussions 
when reading related articles. Thus, it is important to under¬ 
stand the role of affects in software development processes. 
Even more, we share the view of Lenberg et al. that SE 
should also be studied from a behavioral perspective. We 
have, in fact, focused on these issues for some time now, by 
producing several articles on this avenue, i.e., 15 32 
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31 33 . We have also proposed the term psychoempiri¬ 


cal software engineering [34] to denote research in SE with 
proper theory and measurement from psychology. Our mes¬ 
sage was well-received by the community with some degree 
of agreements regarding terminology, e.g., [M . However, 
we show below that long is the road to properly address the 
human aspects of SE with psychology. 


Problem: SE Lacks in Theoretical Background of Af¬ 
fects and Guidelines for Using Psychology. 
















Given the rising number of recent SE articles that deal 
with the affects of developers, e.g., 27 36 19 , we believe 

that it should be important for researchers to adopt a critical 
view of the phenomenon under study, and that they do not 
fall into the several mis conceptions when dealing with the 
affect of developers [33| . 

Yet, we understand that we have placed ourselves in a “very 
confused and confusing field of study” ([^, p. 2). We expe¬ 
rienced this confusion especially during our talks at ISERN 
2014, where we chaired a workshop called psychoempirical 
SE [^, and during the CHASE 2015 workshop [^, where we 
presented some common misconceptions and measurements 
of the affect of software developers |^. Such misconceptions 
include confusing affect and the related constructs of emo¬ 
tions and moods with motivation or job satisfaction, which 
has happened even in articles already dealing with miscon¬ 
ceptions of motivation with respect to job satisfaction, e.g., 
128], although affects were not the focus of the study in this 


case. 

Other issues he in missing out the opportunity of using 
validated measurement instruments for affect. An example 
is the use of the niko-niko calendar for assessing the mood 
of a software development team, e.g. [76] , or the so-called 
happiness index, e.g., |55| . Another example of the missed 
opportunity is when a single truth is assumed in the writing 
of articles, like in a CACM positional article claiming that 
“psychologist recognize eight basic emotions, with each posi¬ 
tive balanced by a negative”, e.g. “love-hate” ( |18| , p. 34), 
or in a proper empirical study where it has been claimed that 
“there are six basic emotions or universal emotions: anger, 
happiness, fear, [..]” ([^, p. 1079). We will show below 
that it is not true that a unique dominant, accepted theory 
exists for affect, emotions, and moods. Researchers should 
recognize this issue when employing such delicate concepts 
for conducting research. 


Proposal: Theoretical Background of Affects and Guide¬ 
lines for Psychoempirical SE. 

While it would be preposterously arrogant on our side to 
claim the all-encompassing knowledge of the topic, we would 
like to share what we have learned so far in our journey to 
understanding software developers through their affect. This 
article builds upon our experience, the feedback collected 
at our talks and peer review processes, and the previously 
conducted research, to build some theoretical background 
for understanding the affect of software developers. We draw 
from research in psychology in the last decades, and offer 
a comprehensive review of the theory of affect (section 
and, as a follow-up of our ISERN 2014 workshop [34] , we 
propose our guidelines for psychoempirical SE (Section 
for conducting studies in SE with psychological theory and 
measurement. 


[.. . ] often used in situations where emotions dominate the 
person’s awareness” [^. This definition is problematic as it 
contains the term emotion, which has not yet been defined, 
and it does not help in defining the (now apparently) super¬ 
construct affects. Indeed, the term affects is often associated 
in the literature with emotions and moods. We now are left 
with three terms, which look remarkably similar to each 
other. 

Plutchik [65] has defined emotions as the states of mind 
that are raised by external stimuli and are directed toward 
the stimulus in the environment by which they are raised. 
However, Kleinginna et al. reported one year later that 
more than 90 definitions have been produced for this term, 
and no consensus in the literature has been reached. The 
term has been taken for granted and often defined with 
references to a list, e.g. anger, fear, joy, surprise |^. To 
worsen this, emotion as a term is not universally employed, as 
it is a word that does not exist in all languages and cultures 



Moods have been defined as emotional states in which the 
individual feels good or bad, and either likes or dislikes what 
is happening around him/her |^. Yet again, a definition of 
one construct contains another construct of our interest. 


How Do Emotions and Moods Differentiate, Then 7. 

While for some researchers certain moods are emotions 
and vice versa |17] , it has been suggested that a distinction 
is not necessary for studying cognitive responses that are not 
strictly connected to the origin of the mood or emotion [88] . 
Distinctions between emotions and moods are clouded, be¬ 
cause both may feel very much the same from the perspective 
of an individual e xpe riencing either and are now a part 
of common sense |71| . They are embedded in psychologists’ 
questions and, as a consequence, answers. Reisenzein [67] 
argued that “t/ie consensual definition of emotion is not a 
precondition but the result of scientific research; and even 
then, it remains a revisable empirical hypothesis" (p. 2). So, 
affects, emotions, and moods are an emergent construction 
rather than a latent entity 57 . 


We have adopted the same stance of several researchers 

and employed the noun 


in the various fields 178 77 87 16 


affects (affective states) as an umbrella term for emotions 
and moods. We will show that, according to a recent unifying 
theory, this strategy does make sense. 


2.1 The Major Frameworks for Affect Theo¬ 
ries 

According to Huang [38] , four major theories exist for 
emotions (moods, affects) in psychology. However, we see 
that these four theories and all the other we could review 
fall into two competing frameworks. 


2. AFFECT, EMOTIONS, MOODS: THEO¬ 
RETICAL BACKGROUND 

The fields of psychology have failed to agree on the def¬ 
initions of affects and the related terms such as emotions, 
moods, and feelings . Yet, it is desirable that we 

provide a starting set of definitions, which we will however 
criticize. 

Let us start by stating that the term affect (or affective 
state) has been defined as “any type of emotional state 


The Discrete Eramework. 

One framework, namely the discrete approach, collects a 
set of basic affective states that can be distinguished uniquely 
[65] , and that possess high cross-cultural agreement when 
evaluated by people in literate and preliterate cultures [22] . 

The Differential Emotions Theory [39] states that the hu¬ 
man motivation system is based on ten fundamental emotions 
(interest, joy, surprise, distress, anger, disgust, contempt, fear, 
shame, and guilt). These fundamental emotions function for 
the survival of human beings, possess an own neural network 

















in the brain, and an own behavioral response. Finally, these 
emotions interact with each other simultaneously. 

Ekman [22| proposed a set of basic affects, which include 
anger, happiness, surprise, disgust, sadness, and fear. How¬ 
ever, the list has received c riti que, leading to an extended 
version of eleven elements |23| . They include amusement, 
embarrassment, relief, and shame. 

In the Circular Model of Emotion [^, a structure describ¬ 
ing the interrelations among emotions has been proposed. 
Eight primary, bipolar affective states were presented as 
coupled pairs: joy versus sadness, anger versus fear, trust 
versus disgust, and surprise versus anticipation. These eight 
basic emotions vary in intensity and can be combined with 
each other, to form secondary emotions. For example, joy 
has been set as the midpoint between serenity and ecstasy, 
whereas sadness has been shown to be the midpoint between 
pensiveness and grief. Emotions can vary in intensity and 
persistence (to form moods, for example). Emotions, under 
this theory, serve an adaptive role in dealing with survival 
issues. 

Developing a minimal list of basic affective states appears 
to be difficult with the discrete approach. Subsequent studies 
have come to the point where more than 100 basic emotions 
have been proposed [79]. 


The Dimensional Framework. 

The dimensional framework groups affects in m ajor dim en- 
sions that allow a clear distinction among them . In 

the PAD models, three dimensions of Pleasure-displeasure, 
Arousal-nonarousal, and Dominance-submissiveness |74[ |69[ 
[ 5 ^ characterize the emotional states of humans. Valence (or 
pleasure) is the attr activene ss (or adverseness) of an event, 
object, or situation [52| [48| . The term refers to the “direc¬ 
tion of a behavioral activation associated toward (appetitive 
motivation) or away (aversive motivation) from a stimulus” 
46 . Arousal represents the intensity of emotional activation 
46 . It is the sensation of being mentally awake and reactive 


to stimuli, i.e. vigor and energy or fatigue and tiredness [89] . 
Dominance (or control, over-learning) represents a change 
in the sensation of the control of a situation [^. It is the 
sensation by which an individual’s skills are perceived to 
be higher than the challenge level for a task |14| . Pigure[^ 
provides a representation of a PAD model of valence and 
arousal, and examples of related discrete affects with an 
indication to where they might correspond on the axes. 

Emotional states under the PAD models include moods, 
feelings, and any other feeling-related concepts. The dimen¬ 
sions are usually bipolar, indicating that the presence of 
pleasure excludes the possibility of displeasure. Some vari¬ 
ations of these models have been proposed using different 
notations but without changing the core meaning [^, some 
of which omit the dominance dimension |46| . 

In the Positive and Negative Affect Schedule (PANAS) 
|85| |86| , the positive and negative affects are considered as 
the two primary emotional dimensions. However, these two 
dimensions are the result of the self-evaluation of a number 
of words and phrases that describe different feelings and 
emotions. That is: discrete emotions are rated but two 
dimensions are evaluated. This theory is designed to present 
a mood scale. Finally, positive and negative affects are 
mutually independent. In Figure]^ the positive (negative) 
dimension would comprise of positive (negative) valence, or 
positive arousal, or both according to the different theories. 
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Figure 1: A PAD model of valence and arousal, and 
examples of related discrete affects. 


We note here that several other theories exist, although 
they are less prominent. One example is the cube of emotion 
|54| , which is a dimensional theory of affect that expresses 
affect in terms of combinations of dopamine, adrenaline, and 
serotonin, which intersect in eight basic (but extreme) affects, 
e.g. distress, interest, joy. 

The Unifying Theory. 

A prominent unifying theory exists as well. Russell and 
Barrett |73| |71| |72| have proposed the concept of core affect 
to unify the theories of emotions and moods in psychology. 
Core affect is “a pre-conceptual primitive process, a neuro¬ 
physiological state, accessible to consciousness as a simple 
non-reflective feeling that is an integral blend of hedonic 
(valence) and arousal values: feeling good or bad, feeling 
lethargic or energized” (p. 147). The state is accessible 
at a consciousness level as the simplest raw feelings, which is 
distinct in moods and emotions. A feeling is an assessment 
of one’s current condition. Therefore, an affect is a very raw 
concept, upon which the more complex of mood and emotion 
is built upon. Pride can be thought of as feeling good about 
oneself. The “feeling good” is core affect and the “about 
oneself” is an additional (cognitive) component. 

Changes in affects result from a combination of happenings, 
such as stressful events on the job. Sometimes the cause of 
the change is obvious. However, sometimes one can undergo 
a change in core affect without understanding the reasons. 
The individuals possess a limited ability to track this complex 
causality connection. Instead, a person makes attributions 
and interpretations of core affect. 

Affect can be felt in relation to no obvious stimulus—in a 
free-floating form—as moods are perceived. Indeed, mood is 
dehned as a prolonged core affect without an object, i.e. an 
unattributed affect. 

In the core affect theory, emotions are episodes instead of 
simple objects. An emotion is a complex set of interrelated 
sub-events about a specific object. 

The core affect theory is interesting because it unihes the 
previous theories, and it maintains compatibility with the 






majority of the existing measurement instruments, regardless 
of them being about moods or emotions. Although we do 
not neglect moods and emotions per se, when adopting the 
core affect theory we chose to understand the states of minds 
of software developers at the affective level only, which is the 
foundation of moods and emotions. 


Core Affect Is our Current Suggestion to Frame SE Re¬ 
search on Affect. However, a researcher should select the 
affective framework and theory that better suits the research 
objective and the level of details that are desirable. We 
provide more details in the next section. What is important 
is that researchers are aware of an absence of an absolute 
truth and of the many existing alternatives, and that they 
justify their choice. 

3. GUIDELINES FOR PSYCHOEMPIRICAL 
SOFTWARE ENGINEERING 

A much requested feature in our previous discussions at 
recent academic venues such as ISERN 2014, CHASE 2015 
[^, and ICSE 2015 had been How should one conduct re¬ 
search with psychological measurements? By making sense 
of the hundreds of articles we reviewed on psychology and 
organizational behavior, we came up with a simple series of 
steps, listed below. 


Defining a Research Objective. 

As with any research activity, it is important to understand 
what we want to do in a study. Suppose two different, yet 
common scenarios with the affects of developersAThey have 
been adapted from two of our previous studies [M 

Scenario A Assessing how happy developers are generally. 

Scenario B Assessing over a time frame the emotional re¬ 
action of a stimulus (e.g., employing a software tool) 
on developers. 

Both of them require a deep understanding of the topic 
under study. 
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Theoretically Framing the Research. 

Scenario A —From a comprehensive literature review, we 
would understand that we can call happy those developers 
who are in a strongly positive mood, or those who frequently 
have positive and meaningful experiences (see for more), 
thus having a positive affect balance. We decide to focus on 
dimensions of affects, e.g. with the Positive and Negative 
Affect Schedule (PANAS) 86 , which still lets us evaluate 
discrete affects before the aggregated scores. 

Scenario B —Suppose that, instead of asking a developer 
what emotions she is feeling when using a tool, we are inter¬ 
ested in knowing how she feels in terms of more aggregated 
dimensions like pleasure, energy, and dominance. We focus 
then on the dimensional theory of affects like the one in the 
PAD models [74|[^[^. 


Selecting a Validated Measurement Instrument. 

Scenario A —The PANAS dimensional model recommend 
employing the PANAS 86 measurement instrument 
which is one of the most notable measurement instruments 
for affective states. However, a deeper look at the literature 
shows that there are several shortcomings that have been 


criticized for this instrument. The PANAS reportedly omits 
core emotions such as bad and joy while including items that 
are not considered emotions, like strong, alert, and determined 
[20[ |53| . Another limitation has been reported in its non¬ 
consideration of the differences i n desira bility of emotions 
and feelings in various cultures [M 53 . Furthermore, a 
considerable redundancy has been found in PANAS items 
|13[ |82| [^ . PANAS has also been reported to capture only 
high-arousal feelings in general [20] . 

Recent, modern scales have been proposed to reduce the 
number of the PANAS scale items and to overcome some of 
its shortcomings. Diener developed the Scale of Positive 
and Negative Experience (SPANE). SPANE assesses a broad 
range of pleasant and unpleasant emotions by asking the 
participants to report them in terms of their frequency during 
the last four weeks. It is a 12-items scale, divided into two 
sub-scales. Six items assess positive affective states and form 
the SPANE-P scale. The other six assess negative affective 
states and form the SPANE-N scale. The answers to the 
items are given on a five-point scale ranging from 1 [very 
rarely or never) to 5 (very often or always). For example, a 
score of five for the joyful item means that the respondent 
experienced this affective state very often or always during 
the last four weeks. The SPANE-P and SPANE-N scores 
are the sum of the scores given to their respective six items. 
Therefore, they range from 6 to 30. The two scores can be 
further combined by subtracting SPANE-N from SPANE-P, 
resulting in the Affect Balance Score (SPANE-B). SPANE-B 
is an indicator of the pleasant and unpleasant affective states 
caused by how often positive and negative affective states 
have been felt by the participant. SPANE-B ranges from 
-24 {completely negative) to -1-24 {completely positive). The 
SPANE measurement instrument has been reported to be 
capable of measuring positive and negative affective states 
regardless of their sources, arousal level or cultural context, 
and it captures feelings from the emotion circumplex |20[ 
|53| . The timespan of four weeks was chosen in SPANE in 
order to provide a balance between the sampling adequacy 
of feelings and the accuracy of memory [^, and to decrease 
the ambiguity of people’s understanding of the scale itself 
1 ^. 

Scenario B —The PAD dimensional models have been 
implemented in several measurement instruments. One of 
the most notable instruments is the Affect Grid |75| , which 
is a grid generated by intersecting the axes of valence and 
arousal accompanied by four discrete affects, i.e. depression- 
relaxation and stress-excitement, to guide the participant in 
pointing where the emotional reaction is located. The affect 
grid has been employed in SE research, e.g. in [?]. Yet, the 
grid was shown to have only moderate validity , thus other 
measurement instruments would be more desi rabl e. Thus 
comes the Self-Assessment Manikin (SAM, (^1^). SAM 
is a pictorial, i.e. non-verbal, assessment method. SAM 
measures valence, arousal, and dominance associated with 
a person’s affective reaction to an object (or a stimulus) [^. 
As a picture is worth a thousand words, we reproduce SAM 
in Hgure[^ The figures of the first row range from a frown to 
a smile, representing the valence dimension. The second row 
depicts a figure showing a serene, peaceful, or passionless 
face to an explosive, anxious, or excited face. It represents 
the arousal dimension. The third row ranges from a very 
little, insignificant figure to a ubiquitous, pervasive figure. It 
represents the dominance affective dimension. As reported 















Figure 2: The Self-Assessment Manikin. 


in |43| , SAM has the advantage of eliminating the cognitive 
processes associated with verbal measures but it is still very 
quick and simple to use. 


Considering Psychometric Properties. 

As we noted in a previous paper [33| , a selected mea¬ 
surement instrument has to possess acceptable validity and 
reliability properties, which are provided in psychometric 
studies of the measurement instrument. Psychometrics is a 
term, which has been misused in SE including ourselves. It 
is a subfield of psychology that focuses on the theory and 
techniques of psychological measurements. Psychometric 
studies deal with the design, development and especially the 
validation of psychological measures. 

A modification to an existing measurement instrument 
(e.g., adding, deleting, or rewording items) often requires a 
new psychometric study because the reliability of a measure¬ 
ment instrument can be compromised. Therefore, it is not 
advisable to modify validated psychological measurements 
or models as it happened in [12] . 

Scenario A —The SPANE has been validated to converge 
with other af fecti ve states measurement instruments, includ¬ 
ing PANAS |20| . The scale provided good psychometric 
properties in the introductory research and in numerous 
follow-ups, with up to twenty-one thousand participants in 
single study [M 
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53 . Additionally, the scale proved 


consistency across full-time workers and students |80| . 

Scenario B —The SAM has been under scholarly scrutiny, 
as well. The original article describing SAM already reports 
good psychometric properties . A very high correlation was 
found between the SAM items and those of other verbal-based 
measurement instruments |58| [59] , including high reliability 
across age [^. Therefore, SAM is one of the most reliable 
measurement instruments for affective reactions [43]. 


Administering the Measurement Instrument Correctly. 


The psychometric properties of a measurement instrument 
in psychology are also calculated by administering the in¬ 
strument in the same way in each study. This is because 
the instructions might influence the participants’ responses. 
For this reason, any good measurement instrument is always 
accompanied with the instructions for the participants. We 
encourage administering a measurement instrument as it is 
reported in the accompanying instructions, and to further 


share the instructions with participants. Furthermore, the 
gained transparency ensures a higher reproducibility of the 
studies. 

We strongly encourage the authors of SE studies to report 
the participants’ instructions when publishing an article, 
preferably in an archived format. 

Scenario A —The SPANE instructions for participants are 
clearly stated in the original paper and in the instrument 
itself, which is freely available. 

Scenario B —The SAM instructions for participants are 
exhaustively reported in the accompanying technical report 
0 . 

Performing Strong Analyses. 

We encourage the authors in SE to spend some time to 
understand whether such complex and delicate constructs 
require accurate analyses. 

Scenario A —The SPANE scores can be considered as 
ordinal values or as discrete pinpoints of a continuous scale. 
Regression analyses on the aggregated SPANE-P, SPANE-N, 
and SPANE-B scores are possible given that the assumptions 
for linear regression are met. Otherwise, especially when 
groups have to be compared, the usual assumptions for 
employing the t-test or non-parametric tests should be taken 
into account. It is also important to report an effect size 
measure such as the Cohen’s d. 

Scenario B —Repeated measures within-subject that need 
a between subject comparison pose several issues. First, 
there is not a stable and shared metric for assessing the 
affects across persons. For example, a score of one in valence 
for a person may be equal to a score of three for another 
person. However, a participant scoring two for valence at 
time t and five at time t+x unquestionably indicates that the 
participant’s valence increased. As stated by Hektner |^, “it 
is sensible to assume that there is a reasonable stable metric 
within persons” (p. 10). In order to have comparable mea¬ 
surements, the raw scores of each participant are typically 
transformed into z-scores (also known as standard scores). 
A z-score transformation is such that a participant’s mean 
score for a variable is zero, and scores for the same variable 
that lie one standard deviation above or below the mean have 
the value equivalent to their deviation. One observation is 
translated to how many standard deviations the observation 
itself is above or below the mean of the individual’s obser¬ 
vations. Therefore, the participants’ measurements become 
dimensionless and comparable with each other, because the 
z-scores indicate how much the values are spread |49| [^ . 

Second, the repeated measurements often present depen¬ 
dencies of data at the participants’ level and the time level 
grouped by the participant. The analysis of variance (ANOVA) 
family provides rANOVA as a variant for repeated measure¬ 
ments. However, rANOVA and general ANOVA procedures 
are discouraged |35| in favor of mixed-effects models, which 
are robust and specifically designed for repeated, within- 
participant longitudinal data |45| |35| [^. A linear mixed- 
effects model is a lin ear m odel that contains both fixed effects 
and random effects |68| . The estimation of the significance 

^ For the participants’ instructions in |30| , see https ://dx 
doi. org/10.77 17/peer 1.289/supp-1 Tor the participants' 
instructions in |29| |3^, see http://dx.doi.org/10.6084/ 
m9. f igshare. 796393' 

^ http://internal.psychology.Illinois.edu/~ediener/ 
SPANE.html 













of the effects for mixed models is an open debate [i] |66] . We 
encourage the reader to follow our reasoning in [32] for a 
deeper discussion. 

4. CONCLUSION 

Affects—emotions and moods—are beginning to be com¬ 
prehensively studied in SE, and other psychological con¬ 
structs are being incorporated in related research. However, 
there is a risk of underusing and misusing the theory and 
the measurement instruments from psychology, and falling 
into the many misconceptions tied to such intriguing and 
complex research topics. 

For this reason, we have proposed the term psychoempiri- 
cal software engineering to refer to the research in SE with 
psychology theory and measurement. This paper described 
the challenge to conduct proper affect-related studies with 
psychology, provided a comprehensive literature review in 
affect theory, and proposed guidelines for conducting psy- 
choempirical software engineering. 

With this article, we hope to raise much needed awareness 
for better use of psychology in SE studies and to begin a 
sane discussion with our peers towards a more standard and 
sound way of conducting studies on the human and social 
aspects of SE. 
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